Annotating Gene Functions by Spectral Clustering for Combining Gene Expressions and Sequences

نویسندگان

  • Limin Li
  • Motoki Shiga
  • Wai-ki Ching
  • Hiroshi Mamitsuka
چکیده

Annotating gene functions is a fundamental issue in the post-genomic era. A typical procedure for this issue is first clustering genes and then assigning functions of unknown genes by using known genes in the same cluster. A lot of genomic information are available for this issue, but two major types of data which can be measured for any genes are microarray expressions and sequences, both of which however have their own flaws. Thus a natural and promising approach for gene annotation is to combine these two data sources. We developed an efficient gene annotation method with three steps containing spectral clustering over the integrated clustering cost for each data source. We examined the performance of our proposed method from viewpoints of clustering and annotations. All experimental results indicate our performance advantage over possible clustering/classification-based approaches of gene function annotation, using expressions and/or sequences.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Annotating gene functions with integrative spectral clustering on microarray expressions and sequences.

Annotating genes is a fundamental issue in the post-genomic era. A typical procedure for this issue is first clustering genes by their features and then assigning functions of unknown genes by using known genes in the same cluster. A lot of genomic information are available for this issue, but two major types of data which can be measured for any gene are microarray expressions and sequences, b...

متن کامل

Clustering of a Number of Genes Affecting in Milk Production using Information Theory and Mutual Information

Information theory is a branch of mathematics. Information theory is used in genetic and bioinformatics analyses and can be used for many analyses related to the biological structures and sequences. Bio-computational grouping of genes facilitates genetic analysis, sequencing and structural-based analyses. In this study, after retrieving gene and exon DNA sequences affecting milk yield in dairy ...

متن کامل

Spectral Preprocessing for Clustering Time-Series Gene Expressions

Based on gene expression profiles, genes can be partitioned into clusters, which might be associated with biological processes or functions, for example, cell cycle, circadian rhythm, and so forth. This paper proposes a novel clustering preprocessing strategy which combines clustering with spectral estimation techniques so that the time information present in time series gene expressions is ful...

متن کامل

خوشه‌بندی داده‌های بیان‌ژنی توسط عدم تشابه جنگل تصادفی

Background: The clustering of gene expression data plays an important role in the diagnosis and treatment of cancer. These kinds of data are typically involve in a large number of variables (genes), in comparison with number of samples (patients). Many clustering methods have been built based on the dissimilarity among observations that are calculated by a distance function. As increa...

متن کامل

Construction of expressing vectors including melanoma differentiation-associated gene-7 (mda-7) fused with the RGD sequences for better tumor targeting

Objective(s): Up to now, many researches have been performed to improve the antitumoral effect of melanoma differentiation-associated gene-7 (mda-7) protein. The purpose of our research was to construct 3 expression vectors producing mda-7 in fusion with RGD (Arginine-Glycine-Aspartic acid) peptide and evaluate their expression.     Materials and Methods: mda-7 gene with two different RGD sequ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009